Picture for Jiaqi Huang

Jiaqi Huang

Theoretical Analysis of Engression and Reverse Markov Engression

Add code
May 31, 2026
Viaarxiv icon

On-Policy Replay for Continual Supervised Fine-Tuning

Add code
May 28, 2026
Viaarxiv icon

Divide-and-Conquer Inference for Large-Scale Visual Recognition with Multimodal Large Language Models

Add code
May 24, 2026
Viaarxiv icon

EagleVision: A Multi-Task Benchmark for Cross-Domain Perception in High-Speed Autonomous Racing

Add code
Apr 13, 2026
Viaarxiv icon

Implicit Strategic Optimization: Rethinking Long-Horizon Decision-Making in Adversarial Poker Environments

Add code
Feb 08, 2026
Viaarxiv icon

ECHO-2: A Large-Scale Distributed Rollout Framework for Cost-Efficient Reinforcement Learning

Add code
Feb 03, 2026
Viaarxiv icon

RelayGR: Scaling Long-Sequence Generative Recommendation via Cross-Stage Relay-Race Inference

Add code
Jan 05, 2026
Viaarxiv icon

P/D-Device: Disaggregated Large Language Model between Cloud and Devices

Add code
Aug 12, 2025
Viaarxiv icon

SAM-R1: Leveraging SAM for Reward Feedback in Multimodal Segmentation via Reinforcement Learning

Add code
May 28, 2025
Viaarxiv icon

Separate to Collaborate: Dual-Stream Diffusion Model for Coordinated Piano Hand Motion Synthesis

Add code
Apr 14, 2025
Viaarxiv icon